2/26 





Host CPU 
102 




Main Memory 
106 








y- 104 













Graphics 
Accelerator/ 
System 
112 










Display Device 
84 





Fig. 2 



3/26 



112 



Boot 
PROM 
30 



32 



L 



104 



Media Processor 
14 



I 



Hardware Accelerator 
18 



Frame Buffer 
22 



Output Processor 
24 



I 



DAC 26 



f 



I 



DRDRAM 
16 



Texture 
Buffer 
20 



Video Encoder 28 



Fig. 3 



4/26 




5/26 




CD »— 
.CD 



^ evil 




6/26 








t 




t 






dbsel 




dbsel 


t 


+ 1 




1 1 



o 

CM 
CO 



2 

I 

Q 

CO 



E 







1 












GO 


CMI 




cnjI 1 


E 




5 




Li. 





7/26 



Portion of Sample Space 



Y 
A 



2x2 Tile 



° °o ° o 

°o ° ° 
° o 

o o 
o o o 


0 \ 0 0 
0 \ 0 0 


O O O 
O O O 

S O O 
O ° O 
n 00 O 


0 00 

°o 0 0 

0 0 0 \ 
0 0 0 

0 0 


00 0 

0 00 

0 0 0 

0 0 0 0 

0 ° 0 n 


O O O 

00 0 
0 0 0 

0 0 0 0 


00 0 
0 ° o° 

0 0 0 

0 /* • 


O O O 
0 O O 

00 ° 

O O 
w O O 


0 0 0 

OOO 

0 00 

00 0 


0 0 Q ° 0 
0 0 0 

0 0 
o 0 0 


0 0 0 
0 0 0 

0 0 0 

0 0 0 0. 




..^ 

• •• • 

• • • 


00 0 
S--o 0 

• • • 


0 0 o q 

0 0 0 

0 o^— — 
V^^^° 0 


0 0 y£ 




• • « 




-0-^ O q 


0 ° n/ m 

0 O X • 






-0^0 0 
0 0 o 


°o 0 0 ° 


0 0 O 0 


OOO 

0 0 0 o 


0 0 o o 

0 0 0 0 


OOO 


°o°° 

0 0 0 

0 o° 0 0 



Typical Bin (i.e. Fragment) 
containing multiple samples 
(denoted as small circles) 



Triangle 



Fig. 7 



• 72 "V 


Bin ^ 
• • 


Bin • 
• • 

• • 

> • 


Bin ' 

• * B> 

• 70 .v 


Bin | 


> ^ Bin • 

/ # 
/ • 


• 

• • 


< 

• • 


M — 

Bin 

• • 
• 

• • 



Fig. 8 



8/26 



CD 



< 

CN 
CM 

4 

CD 
JD 

E 

03 
CO 



y 



CO 

CD d) CN 

*5 oco 
Q 



o 
to 

CO 



03 
^— » 
05 

Q 



CD 

E 
o 

CD 

CD 









6 


6 


6 


LU 


LU 


LU 


o 


i— " 
o 


O 


o 


o 


O 


O 


O 


O 


N 




N 










xf 


X" 




CN 


CO 




=«: 




X 


X 






<D 
















2 


5 


5 



CO 

c5 
c 

"2 
o 
o 
O 

_CD 
Q_ 

E 

05 
CO 



E °° 

0) 



(0 

o 

Q_ 
_CD 
CL 

E 

05 
CO 





a 


8 




« 










d 


#(i-i) 


rsT 
co~ 
cd" 


N 

co~ 
cd" 




rsT 
do" 
CD" 


c 
CO 

z 


N 
CO 

CD" 


N 
03 

CD" 




isf 
co" 
CD" 


Bin 


of 

CO 


of 

CN 
CO 




of 

CO 


c 
CO 


of 

CO 


of 

CN 
CO 




of 

CO 


■ 


* 
• 










d 




d 


d 




53 


CM 
*t 


N 

CD 


do" 




isf 
co~ 


+ 


NJ" 
CD" 


N 
CO 




isf 

CD 


C 


CD" 


o" 




CD" 




CD" 


CD" 




CD" 


CD 


of 


of 




of 




of 


of 




of 




CO 


c\i 
CO 




z: 

CO 


CD 


CO 


cvi 
CO 




CO 










d 




d 


d 




d 




N 


rsT 




rsj" 




NJ 


N 






Bin #1 


CD 
CD" 

of 


CD 

CD" 
of 




cd" 
CD" 
of 


c 
CD 


cd" 
CD" 
of 


co~ 

CD" 

of 




CD" 

CD" 
of 




CO 


cxi 
CO 




z 

CO 




CO 


csi 
CO 




CO 



CN 
X 
CN 



CD 
LU 

co 
E 
CD 



■N 

y 



CO 

(D 
-t — • 

03 

o 
o 
O 
_cd 

Q_ 

E 

05 
CO 





/ 




/ 




s 







co 




— CO 




<D CD 




X O 




ol 2 






CM 




Q. 05 




E5 




CD O 
CO 05 




O 





LU 



Q. 

3 JO 
O <^ 

! I 

m 8 

CD "5 

E > 

CD 



CD 



_co 

o 



>- 

■o 


AP 


AP 




x 

"D 


x 

"D 




SN: 


SN: 








> 

•o 


AP 


> 

"O 


YD 

Ar 


yp 


X~ 

"D 


CN 
CO 


<N 
CO 


c\i 
CO 


>- 

"O 


>- 




xT 

X3 


xf 

"O 


xf 


CO 


CO 


CO 


Regular 


Perturbed 


Stochastic 



9/26 



0.5 




1 


I 


-1 


1 



Fig. 10A 




Fig. 10B 



Filter base 

Fig. 10C 



Area to be 
considered 











___| 






I 


i-T 






LZ 




j 














-j 



Actual filter 



Fig. 10D 



10/26 




Fig. 12A 



Fig. 12B 



Fig. 120 



1 











Filter base 



Fig. 13A 



Fig. 13B 



Fig. 130 



11/26 



123456789 10 




s s \ \ 

/ / \ \ 

/ / \ \ 

I ( ■ - 1 I 

\ \ / / 

X V ' ' 

Pixels in same scanline 



A bin of samples 



Filter boundary 




Filter center 



Fig. 14 



12/26 




Fig. 15 



13/26 




Zoom factor = 2. delta = 0.5 
filter radius = 1 




Fig. 17 



dx 2 



dy 2 



Squareroot 
table 



A 



Filter table 



Weight 



Radius 



Fig. 18 



14/26 



Sample Position 




Generation 




422 




- \ 




x 2 + y 2 = d 2 




424 



I 



Squareroot table 
426 



A 



428 



Weight table 



430 



Weight queue 



Sample 
Memory 
22A 



innrv 



Cache 8x5x1 6xRGB 402 





Fig. 19 



15/26 



I 2 .3 4 5 6 7 8 9 10 

-^"^s -^X> -^**V. ^"v. \ 

/ /\/\/\/\/\/\/\/\ \ \ 



v 

A- 








XI 


X^ 




X 




"V 1 




V] 




0 


0 






0 




Hi 




















y 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 



























































































































/ / \ \ 
/ / \ \ 
/ / \ \ 

' ' p ■ I I 

\ 1 

\ \ 
\ \ 



/ / 

/ / 



Pixels in same scanline 



A bin of samples 



Filter boundary 




Filter center 



Fig. 20 



16/26 




A bin °f samples 




Fig. 21 



17/26 




i I 1 I i I i i 



Fig. 22 



Zoom factor = 2, delta = 0.5 
filter radius = 1 




Fig. 23 



T-T-T--T--I , 




Fig. 24 



18/26 





Fig. 25 



19/26 



Block boundary 




Douhdgry 



Fig. 26 



Pixel 



Pixel 




0 read tile 0 

1 read half tile 1 

2 read tile 2 

3 read half tile 3 

4 filter pixel. 



Fig. 27 



Pixel M 



P'xel H 




Pixel 



i+1 

0 read tile 0 

1 read half tile 1 

2 filter pixel M & pixel H 

3 read tile 2 

4 read half tile 3 

5 filter pixel, & pixelj 

6 filter pixel j+1 & pixel j+1 



Pixel 



'j-i 



Fig. 28 



20/26 



Pixel 



Pixel 



Pixel 




Pixel j+3 

0 read tile 0 

1 read half tile 1 

2 read tile 2 

3 read half tile 3 

4 filter pixelj & pixelj 

5 filter pixel i+1 & pixel j+1 

6 filter pixel j+2 & pixel j+2 

7 filter pixel i+3 & pixel j+3 



Pixel 



'j+3 



Fig. 29 



binlnlv [1:0] 
0 

For Non-Buddy Mode: 1 

2 
3 



sample [1:0] 
0 12 3 



abed 
c d a b 
d c b a 
bade 



Where a, b, c, d identify different 
interleave bus to 3DRAM 



binlnlv [1:0] 
0 

For Buddy Mode: -] 

2 
3 



sample [2:0] 
0 1 2 3 4 5 6 7 



aabbcedd 
ceddaabb 
ddcebbaa 
bbaaddec 



Where a, b, c, d identify different 
interleave bus to 3DRAM 



Fig. 30 



21/26 



A full tile from SW 



SD16 Mapping: 



binlnlv [1:0] 
0 
1 

2 
3 



Sample 
0 1 2 3 4 5 6 7 



89a bcdef 



cTdYa 



I 



PI 


a 


b 


c 


d 


a 


fb) 


c 


(51 


b 


c 


d 


a 


b 


c 


d 


a 


b 


a 


d 


c 


b 


a 


d 


c 


b 


a 


c 


b 


a 


d 


c 


b 


a 

<^ 


d 


c 



16 sample tiles from SG to Fragment Pipe 



A full tile from SW 



SD16B Mapping: 



binlnlv [1:0] 
0 
1 

2 
3 



Sample 
0 1 2 3 4 5 



6789abcdef 



a 


a 


b 


b] 


fc 


c 


d 


Pi 


a 


a 


b 


b 


C 


c 


d 


PI 


c 


c 


d 


d 


a 


a 


b 


b 


c 


c 


d 


d 


a 


a 


b 


b 


d 


d 


c 


c 


b 


b 


a 


a 


d 


d 


c 


c 


b 


b 


a 


a 


b 


b 


a 


a 


d 


d 


c 


c 


b 

<-> 


b 


a 
<^ 


a 


d 


d 


c 


c 



i 



16 sample tiles from SG to Fragment Pipe 



Fig. 31 



22/26 







or 


• 


• 




• 







A horizontally half tile from SW 



SD16 Mapping: 



binlnlv [1:0] 
0 
1 
2 
3 



i 



Sample 

01 23456789abcdef 




c b a 

badcbadc badcbadc 



8 sample tiles from SG to Fragment Pipe 







or 


• 


• 


• 


• 







A horizontally half tile from SW 



binlnlv [1:0] 

SD16B Mapping: 0 

1 
2 
3 



Sample 

01 2 34 5 6 7 8 9 a bcdef 




8 sample tiles from SG to Fragment Pipe 



Fig. 32 



23/26 




Fig. 34 



24/26 



sw 



3DRAM 



FCQ 



c_rd_*(tx,ty. 

dO: {30'bx, 36'bx}) 



c_wr_*(tx,ty. 
dO: {66'bx}) 



c_wr_*(tx,ty. 
dO: {66'bx}) 



SG/SE i TE 

c_rd_*(tx,ty. 
dO: {30'bx, 36'bx}) 

c_rd_*(tx,ty. 
dO: {30'bx, 6'bx}) 

c_rd_*(tx,ty. 
dO: {30'bx, 36'bx}) 

c_rd_*(3DRAM Addresses. 
dO: {30'bx. 14'bx. 
fmt3[1:0].fmt2[1:0]. 14'bx. 
fmt1[1:0].fmt0[1:0]}) 



c_rd(48'b data) 



c_rd_*: 

1) *means either with / out row 

2) All c_rd_* can be replaced by 
rd accum *orrd accum2* 



r • 

I 

I 

T 



FCQ 



ll 



FP 



II 

FBA 

!! 

FBI 

ii 



1^ 
t 

FRB 
I 
I 
I 

t 

PXM 

I 
I 

t 

• PX 



c_rd_*(14'bx,fmt3[1:0]. 
fmt2[1:0]. 14'bx. 
fmt1[1:0].fmt0[1:0]) 
c_rd(48'b data) 



c_rd(48'b data) 



c_wr_*(bc,ty. 
dO:{valid}) 

c_wr_*(tx,ty. 
dO:{valid}) 



c_wr_*(3DRAM Addresses. 
dO: {valid}) 



Read Pass Write Pass 



3DRAM 



I 



Fig. 35 



25/26 



SW 



3DRAM 



FCQ 



rd_samples_*(tx,ty, 
dO: {60'bx, ff, 4'bx}) 



TE 



11 



ft 

SG/SE 

(Loop Through all samples 
rd_samples_*(tx,ty, bin .sample 
dO: {60'bx, ff. 4'bx'}) 

rd_sampres_*(tx,ty, bin.sample FP 
dO: {60'bx, ff. 4'bx'}) 

rd_samples_*(tx,ty, bin.sample 
dO: {60'bx, ff. 4'bx'}) 

rd_*(3DRAM Addresses. 
d0:{30'bx. y[11:0]. 2'bx. 
fmt3[1:0].fmt2[1:0]. 
x[11:0].ff 

fmt1[1:0].fmt0[1:0]}) 



i 
i 
i 
i 

PX 



^ rd_samples(y[1 1 :0], 
FRB 2'bx.fmt3[1:0],fmt2[1:0], 
i x[11:0],ff 
| fmt1[1:0],fmt0[1:0]) 

t 

PXM 

I 
l 



rd_sample_*: 

1) "means either with / out row 



FCQ 



II 

FBA 

II 

FBI 
i 

i 
i 

t 



3DRAM 



Read Pass Write Pass 



i 
i 
i 
i 

♦ 



j 



Fig. 36 



26/26 



sw 



3DRAM 



FCQ 



c_rd_filter*(tx,ty, 
dO:{30'bx, 
12'bxyfrac[5:0], 
12'bx, xfrac[5:0]}) 



c_wr_*(tx,ty, 
d0:{66'bx}) 



▼ 

SG/SE ~ 
c_rd_filter*(tx,ty, 
d0:{30'bx, 
12'bx yfrac[5:0], 
12'bx, xfrac[5:0]}) 



c_wr_*(tx,ty, 
d0:{66'bx}) 



TE 



c_rd_*(48'b data) 



— i — 

t 

FRB 

| c_rd_filter(48'b data) 
PXM 



c_rd_filter( 

y[11:0],yfrac[5:0], 

x[11:0],xfrac[5:0]) 



c_rd_filter*(tx,ty, 
d0:{30'bx, 
12'bx yfrac[5:0], 
12'bx, xfrac[5:0]}) FP 

c_rd_filter*(tx,ty, 
d0:{30'bx, 
12'bx yfrac[5:0], 
12'bx, xfrac[5:0]}) 

FBA 

c_rd_*(3DRAM Addresses=xx, J 
d0:{30'bx.y[11:0] 6 
yfrac[5:0],x[11:0],xfrac[5:0], 



J 

i uA 

11 



t 

PX 



c_rd_filter(48'b data) 



c_wr_*(tx,ty, 
dO:{valid}) 



c_wr_*(tx,ty, 
dO:{valid}) 



c_wr_*(3DRAM Addresses, 
dO:{valid}) 



c_rd_*: 

1) *means either with/out row 

2) All c_rd_* can be replaced by 
rd_accum _*or rd accum2 * 



r - 

I 

I 

▼ 



FCQ 



FBI 




i 








i 




i 






r 


3DRAM 



Read Pass Write Pass 



j 



Fig. 37 



